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Description 

Field of the Invention 

[0001] The present invention relates to a method for 5 
characterising polypeptides and to methods for identify- 
ing and assaying such polypeptides. 

Background to the Invention 

10 

[0002] The characterisation and identification of 
polypeptides from complex mixtures thereof, such as 
protein samples found in biological systems, is a well- 
known problem in biochemistry. Traditional methods in- 
volve a variety of liquid phase fractionation and chroma- is 
tography steps followed by characterisation, for exam- 
ple by two dimensional gel electrophoresis. Such meth- 
ods are prone to artefacts and are inherently slow. More- 
over, automation of such methods is extremely difficult. 
[0003] Patent Application PCT/GB97/02403, filed on 20 
5th September 1997, describes a method for profiling a 
cDNA population in order to generate a 'signature' for 
every cDNA in the population. It is assumed in that meth- 
od that a short sequence of about 8 bp that is determined 
with respect to a fixed reference point is sufficient to 25 
identify almost all genes. This system relies on Immobi- 
lising the cDNA population at the 3' terminus and cleav- 
ing it with a restriction endonuclease. This leaves a pop- 
ulation of 3' restriction fragments. The patent describes 
a technique that allows one to determine a signature of 30 
roughly 8 to 10 base pairs at a specified number of bas- 
es from the restriction site which is a sufficient signature 
to identify nearly ail genes. 

[0004] Techniques for profiling proteins, that is to say 
cataloguing the identities and quantities of proteins in a 35 
tissue, are less well developed in terms of automation 
or high throughput The classical method of profiling a 
population of proteins is by two-dimensional electro- 
phoresis. In this method a protein sample extracted from 
a biological sample is separated on a narrow gel strip. 
This first separation usually separates proteins on the 
basis of their iso-electric point The entire gel strip is 
then laid against one edge of a rectangular gel. The sep- 
arated proteins in the strip are then eiectrophoretically 
separated in the second gel on the basis of their size. *5 
This technology is slow and very difficult to automate. It 
is also relatively insensitive in its simplest incarnations. 
A number of improvements have been made to Increase 
resolution of proteins by 2-D gel electrophoresis and to 
improve the sensitivity of the system. One method to im- so 
prove the sensitivity of 2-D gel electrophoresis and its 
resolution is to analyse the protein in specific spots on 
the gel by mass spectrometry. One such method is in- 
gel tryptic digestion followed by analysis of the tryptic 
fragments by mass spectrometry to generate a peptide ss 
mass fingerprint. If sequence information is required, 
tandem mass spectrometry analysis can be performed. 
[0005] More recently attempts have been made to ex- 



ploit mass spectrometry to analyse whole proteins that 
have been fractionated by liquid chromatography or 
capillary electrophoresis. In-line systems exploiting cap- 
illary electrophoresis mass spectrometry have been 
tested. The analysis of whole proteins by mass spec- 
trometry, however, suffers from a number of difficulties. 
The first difficulty is the analysis of the complex mass 
spectra resulting from multiple ionisation states acces- 
sible by individual proteins. The second major disadvan- 
tage is that the mass resolution of mass spectrometers 
is at present quite poor for high molecular weight spe- 
cies i.e. for ions that are greater than about 4 kilodaltons 
in mass so resolving proteins that are close in mass is 
difficult A third disadvantage is that further analysis of 
whole proteins by tandem mass spectrometry is difficult 
as the fragmentation patterns for whole proteins are ex- 
tremely complex. 

[0006] Biochemistry (1995) 34, 12605-1 26 15 disclos- 
es analysis of the structural core of the human oestro- 
gen receptor ligand binding domain by cleavage by pro- 
teases, and subsequent analysis of the protein frag- 
ments by SDS-PAGE, Edman N-terminal sequncing and 
mass spectrometry (HPLC-coupled electrospray ionisa- 
tion mass spectrometry). 

Summary of the Invention 

[0007] The present invention provides a method for 
characterising polypeptides, which comprises: 

(a) treating a sample comprising a population of a 
plurality of polypeptides with a cleavage agent 
which is known to recognise in polypeptide chains 
a specific amino acid residue or sequence and to 
cleave at a cleavage site, whereby the population 
is cleaved to generate peptide fragments; 

(b) isolating a first population of peptide fragments 
which comprises only terminal peptide fragments 
bearing as a reference terminus the N-terminus or 
the C-terminus of the polypeptide from a second 
population of peptide fragments which do not com- 
prise the reference fragment each peptide fragment 
of the first population bearing at the other end the 
cleavage site proximal to the reference terminus, 
said isolation step comprising immobilisation of the 
first population of peptide fragments or immobilisa- 
tion of the second population of peptide fragments 
onto a solid support and 

(c) determining by mass spectrometry a signature 
sequence of at least some of the isolated frag- 
ments, which signature sequence is the sequence 
of a predetermined number of amino acid residues 
running from the cleavage site; 

wherein a signature sequence characterises each 
polypeptide. 
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[0008] The invention therefore describes a system 
analogous to that of PCT/GB97/02403, but for use with 
proteins. Since there are 20 monomers that make up a 
protein there are a great many more possible variants 
at a particular site in a sequence and so the length of 
signature required from a protein sequence is much 
shorter than that required from a cDNA sequence to 
identify it uniquely. 

[0009] This invention can use liquid phase separation 
techniques and mass spectrometry to resolve proteins 
and protein fragments to facilitate automation and avoid 
the artefacts and inherent slowness and lack of automa- 
tion in gel based techniques such as 2-D gel electro- 
phoresis. 

[0010] The reference terminus may be attached to a 
solid phase support to immobilise the population of 
polypeptides or peptide fragments thereof. Preferably, 
the population of polypeptides is immobilised before 
treatment with the cleavage agent In this way, the pep- 
tide fragments produced on treatment with the cleavage 
agent remain immobilised and can be readily isolated 
by washing away unwanted material present in the liquid 
phase. The solid phase support may comprise suitable 
beads or other such supports well known in this art. 
Such supports or substrates may be chosen to bind se- 
lectively to either the N-terminus or the C-terminus and 
this is discussed in further detail below. 
[001 1] In one embodiment, the reference terminus is 
attached to the solid phase support by: (i) treating the 
polypeptides with a blocking agent to block all exposed 
reference groups, which comprise either carboxyl 
groups or primary amine groups; (ii)cleaving the refer- 
ence terminal amino acids to expose unblocked refer- 
ence termini; and (iii) treating the unblocked reference 
termini with an immobilisation agent capable of coupling 
to the solid phase support; wherein step (b) comprises 
binding the treated refrence termini to the solid phase 
support and removing unbound peptide fragments. In 
an alternative embodiment, the method further compris- 
es (i) preparing the sample step (a) by pre-treating the 
polypeptides with a blocking agent to block all exposed 
reference groups, which comprise either carboxyl 
groups or primary amine groups, so that subsequent 
treatment of the sample with the cleavage agent gener- 
ates peptide fragments bearing unblocked reference 
termini; (ii) biotinylating the unblocked reference termini; 
and (iii) binding the peptide fragments containing the un- 
blocked reference termini to a solid phase support; 
wherein step (b) comprises eluting unbound peptide 
fragments therefrom. Preferably, the immobilisation 
agent comprises a biotinylation agent 
[001 2] The cleavage agent must recognise a specific 
amino acid residue or sequence of amino acids reliably. 
The cleavage site may be at the specific amino acid res- 
idue or sequence or at a known displacement therefrom. 
The cleavage agent may be a chemical cleavage agent 
such as cyanogen bromide. Preferably, the cleavage 
agent is a peptidase, such as a serine protease, prefer- 
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ably trypsin. 

As discussed in further detail below, depending on the 
number of proteins or polypeptides in a given sample, it 
may be advantageous to Sort the polypeptides into 
5 manageable sub-populations. Sorting can be effected 
before treatment of the sample with the cleavage agent 
or after cleavage. As discussed in further detail below, 
the sample of step (a) may comprise a sub-cellular frac- 
tion. In this way, the method further comprises a step of 
sub-cellular fractionation before step (a). The sample of 
step (a) may be prepared by liquid chromatography of 
either a crude fraction or a sub-cellular fraction. A pre- 
ferred method of determining the signature sequence is 
by mass spectrometry and this may be preceded by a 
high pressure liquid chromatography step to resolve the 
peptide fragments. Alternatively, the peptide fragments 
may be subjected to ion exchange chromatography be- 
fore step (c) followed by sequencing by mass spectrom- 
etry. 

[0013] In accordance with the method of the present 
invention, the predetermined number of amino acid res- 
idues required to constitute the signature sequence will 
vary according to the size of the polypeptide population. 
Preferably, the predetermined number of amino acid 
residues is from 3 to 30, more preferably 3 to 6. 
[0014] The present invention further provides a meth- 
od for identifying polypeptides in a test sample. The 
method comprises characterising the polypeptides as 
described above and comparing the signature sequenc- 
es and relative positions of the cleavage site obtained 
thereby with the signature sequences and relative posi- 
tions of the cleavage site of reference polypeptides in 
order to identify the or each polypeptide in the test sam- 
ple. This method can be used to identify a single un- 
known polypeptide or a population of unknown polypep- 
tides by comparing their characteristics (i.e. their signa- 
ture sequences and relative positions of cleavage site) 
with those of previously identified polypeptides. It is en- 
visaged that the database of such characteristic can 
readily be compiled. 

[0015] In a further aspect, the present invention pro- 
vides a method for assaying for one or more specific 
polypeptides in a test sample. The method comprises 
performing a method as described above, wherein the 
cleavage agent and relative position of the cleavage site 
is predetermined and the signature sequence is deter- 
mined in step (c) by assaying for a predetermined se- 
quence of amino acid residues running from the cleav- 
age site. Preferably, the cleavage site and signature se- 
quence are predetermined by selecting corresponding 
sequences from one or more known target polypep- 
tides, such as those available from the database. 

Brief Description of the Drawings 

[0016] The Invention will now be described in further 
detail, by way of example only, with reference to the ac- 
companying drawings, in which: 
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FIGURE 1 shows a reaction scheme according to 

one embodiment of the invention; 

FIGURE 2 shows a reaction scheme according to 

another embodiment of the invention; 

FIGURE 3 shows a reaction scheme according to 5 

a simple embodiment of the invention; and 

FIGURE 4 shows a reaction scheme according to 

a variation of the embodiment shown in Figure 1 . 

Brief Description of the Invention 

Protein Signatures: 



which are the properties used most effectively in 2-D gel 
electrophoresis. Such separations can be achieved as 
rapidly or more so using liquid chromatographic tech- 
niques. In fact, by following one liquid chromatography 
separation by another, one can resolve proteins in as 
many dimensions as one requires, since there is a great 
deal more flexibility in liquid chromatography separation 
systems, although one would ideally avoid too many 
separation steps to prevent sample loss. 
[0021] Sorting can be effected during extraction, after 
extraction of proteins from their source tissue or after 
cleavage of immobilised peptides. 
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[001 7] The essence of this system is that one can im- 
mobilise a population of proteins onto a solid phase sub- 
strate at one terminus of the molecule. Proteins are di- 
rectional so a particular terminus can be chosen in a 
manner dependent on the chemistry of the immobilisa- 
tion agent, for example the Edman reagent (phenyl iso- 
thiocyanate) can be used selectively to remove amino 
acids from the N-terminus of a protein; however, if phe- 
nyl isocyanate is used the N-terminus is simply capped. 
A derivative of this molecule that could be coupled to a 
cleavable linker on a solid-phase substrate would allow 
a protein to be immobilised at its N-terminus and sub- 
sequently removed by cleavage of the linker. During 
peptide synthesis, the C-terminus is usually immobilised 
as a benzyl ester, through the use of a chloromethyl 
group. Such chemistry may be adapted to immobilise 
proteins at this terminus, if desired. 
[0018] A population of immobilised proteins is then 
treated with a sequence specific peptidase such as 
trypsin to leave a population of N-terminal cleavage 
fragments. Such fragments can be considered to be 
analogous to an expressed sequence tag for a protein. 
One can then sequence the resultant peptide signatures 
by mass spectrometry. Terminal fragments are most 
meaningful, in that the position of all resultant peptide 
in the protein is known and the termini are usually ac- 
cessible at the surface of most proteins. 

Sorting Proteins: 

[0019] Obviously a population of proteins extracted 
from a cell is going to be a significant number of distinct 
species. If, as it is thought there are roughly 15000 
genes expressed In the average human cell, one can 
expect as many proteins. Clearly one cannot sequence 
all of these by mass spectrometry in a single step, with 
present technology. For this reason a protein population 
of such size needs to be sorted into manageable sub- 
sets. 

[0020] A generalised system for profiling proteins 
must attempt to resolve a protein population into rea- 
sonably discrete subsets of relatively uniform size. This 
is most readily achieved by separation on the basis of 
global properties of proteins, that vary over a broad and 
continuous range, such as size and surface charge, 



Sorting during cell fractionation: 

15 

[0022] , Proteins are intrinsically sorted in vivo, in terms 
of their compartmentalisation within a cell. Various tech- 
niques are available that allow one to sort proteins on 
the basis of their cellular compartments. Fractionation 

20 protocols involve various cell lysis techniques such as 
sonication, detergents or mechanical cell lysis that can 
be coupled to a variety of fractionation techniques, 
mainly centrifugation. Separation into membrane pro- 
teins, cytosolic proteins and the major membrane bound 

25 sub-cellular compartments, such as the nucleus and mi- 
tochondria, is standard practice. Thus one can effective- 
ly ignore certain classes of protein if one chooses, e.g. 
mitochondrial proteins are likely to be uninteresting in a 
lot of cases. Membrane, cytosolic and nuclear compart- 

30 ments will be of particular interest on the whole. 

Sorting after extraction: 

[0023] Since proteins are highly heterogenous mole- 
35 cules numerous techniques for separation of proteins 
are available on the basis of size, hydrophobicity, sur- 
face charge and various combinations of the above us- 
ing liquid chromatography in its various incarnations. 
Separation is effected by an assortment of solid phase 
to matrices derivitised with various functionalities that ad- 
here to and hence slow down the flow of proteins 
through the column on the basis of the properties above. 
Molecules are normally loaded Into such columns in 
conditions favouring adhesion to the solid phase matrix 
45 and selectively washed off in steadily increasing quan- 
tities of a second buffer favouring elution. In this way the 
proteins with the weakest interactions with a given ma- 
trix elute first 

[0024] Various formats for liquid chromatography ex- 
so ist but for greatest speed of throughput and for the most 
discrete separations High Pressure Liquid Chromatog- 
raphy (HPLC) formats are favoured. In this format the 
matrix is designed to be highly incompressible and when 
derivitised allow chromatographic separation to be per- 
55 formed at extremely high pressures which favours rapid 
and discrete separation. 
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Sorting of cleaved peptides: 

[0025] Liquid chromatography mass spectrometry 
(LCMS) is a well developed field. HPLC systems directly 
coupled to electrospray mass spectrometers are in 5 
widespread use. HPLC is a fast and effective way of re- 
solving peptides after they have been cleaved from their 
immobilised state. 

[0026] Alternatively sorting peptides by ion exchange 
chromatography might be advantageous, in that short *o 
peptides could be separated in an almost sequence de- 
pendent manner the amino acids that are ionisable 
have known pKa values and hence elution of peptides 
from such a column at a specific pH t would be indicative 
of the presence of particular amino acids in that se- f 5 
quence. For example, aspartate residues have a pKa of 
3.9 and gtutamate residues 4.3. Elution of a peptide at 
pH 4.3 would be indicative of the presence of glutamate 
in the peptide. These effects are sometimes masked in 
large proteins but should be distinct in short peptides, 20 
hence would be extremely useful as sorting features. 
[0027] Combination of the above techniques will allow 
various sorting protocols to be developed that will allow 
great control over the form of the protein profile gener- 
ated. In this way, identification of most proteins ex- 25 
pressed in a cell should be achievable. 

Sequencing of peptides by mass spectrometry: 

[0028] Peptides can be readily sequenced directly by 30 
tandem mass spectrometry. In general, peptide mix- 
tures are injected into the mass spectrometer by elec- 
trospray, which leaves them in the vapour phase. The 
first mass spectrometer acts as a filter selecting mole- 
cules to enter the second mass spectrometer on the ba- 35 
sis of their mass charge ratio, such that essentially only 
a single species enters the second mass spectrometer 
at a time. On leaving the first mass spectrometer, the 
selected peptide passes through a collision chamber, 
which results in fragmentation of the peptide. Since frag- 40 
mentation occurs mostly at the peptide bond, the pattern 
of fragments corresponds to a series of subspecies of 
peptides and amino acids that compose the original 
peptide. The distinct pattern of masses of single amino 
acids, 2-mers, 3-mers, etc. generated in the fragmenta- 45 
tion of the peptide is sufficient to identify its sequence. 
[0029] The end result is then that a population of pro- 
teins can be arbitrarily sorted into populations of pep- 
tides of convenient size to be fed into an electrospray 
tandem mass spectrometer for direct sequencing. Com- so 
pletion of such an analysis for an entire cell's proteins 
would give a profile of what proteins are present and In 
what relative quantities. Absolute quantitation could be 
achieved by 'spiking* a protein population with known 
quantities of particular proteins, known to be absent, e. 55 
g. plant proteins in animal samples or Wsa versa against 
which to calibrate results. 



Protein Signatures: 

[0030] This invention provides a method of capturing 
a population of proteins onto a solid phase substrate by 
one terminus of each protein in the population. This in- 
vention also provides a method of cleaving proteins that 
have been derivatised at one terminus with an agent that 
can be used to immobilise that terminus on a solid phase 
substrate. This allows a single peptide for each protein 
in a population to be captured onto a solid phase sub- 
strate thus peptides from the chosen terminus can be 
separated from other peptides generated by the cleav- 
age step and can be isolated. This invention also pro- 
vides a method to allow all the peptides generated in a 
cleavage step that are not from the reference terminus 
to be captured leaving a single terminal peptide per pro- 
tein free in solution for analysis. 
[0031] A population of peptides generated according 
to the methods of this invention can be analysed in a 
number of ways preferably by mass spectrometry. 
[0032] Two forms of analysis are preferred. The first 
is to determine peptide mass fingerprints for the popu- 
lation of signature peptides generated. In this method 
the mass of each peptide, preferably the accurate mass, 
is determined. A significant proportion of signature pep- 
tides should be uniquely identified by this form of anal- 
ysis. Any mass peaks that are unknown can be further 
characterised by the second form of preferred analysis. 
Ions of a specific mass can be selected for collision In- 
duced dissociation in a tandem mass spectrometer. This 
technique can be used to determine sequence informa- 
tion for a peptide. 

Capturing Peptides: 

[0033] This invention provides methods that exploit 
derivitisation of proteins with various agents, including 
existing peptide sequencing reagents, to isolate a single 
'signature' peptide from each member of a population of 
proteins. This invention may be practised in two formats. 
The methods of this invention allow a reference termi- 
nus to be selected from the proteins in a population. In 
the first format this reference terminus may be deriva- 
tised with an immobilisation agent. If the proteins deri- 
vatised in this manner are treated with a sequence spe- 
cific cleavage agent to generate peptides, the peptides 
from the reference termini of the proteins in a mixture 
can be specifically captured leaving the remaining pep- 
tides free in solution. This first format is discussed in the 
following section headed "Format 1" In the second for- 
mat a single peptide sample per protein is generated by 
capturing the peptide fragments that are not from the 
chosen reference terminus, thus leaving the signature 
peptides free in solution. 

Format 1: 

[0034] In the simplest embodiment of this invention as 
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shown schematically in Figure 3, a population of pro- 
teins is reacted with a modified sequencing agent spe- 
cific for one terminus of each-protein in the population. 
The modified sequencing agent carries an immobilisa- 
tion agent in order that proteins derivatised with the se- 5 
quencing agent may be captured onto a solid phase 
support The captured proteins may then be cleaved 
with a sequence specific cleavage agent This cleavage 
step will generate a series of peptide fragments in solu- 
tion and will leave a single peptide protein captured on w 
the solid phase support The peptides free in solution 
are then washed away. The immobilised peptides can 
then be released from the solid phase support by com- 
pleting the sequencing reaction for the coupled terminal 
amino acid. The Edman reagent (phenyl isothiocyanate) *s 
could be modified to carry an immobilisation agent, the 
phenyl ring could be substituted with a group linked to 
an appropriate immobilisation effector such as biotin. A 
population of proteins derivatised with this reagent could 
be cleaved with trypsin. The derivatised terminal pep- 20 
tides could then be immobilised on an avidinated solid 
phase support allowing underivatised peptides to be 
washed away. The peptides could then be released from 
the solid phase support by disrupting the avidin-biotin 
reaction. This will leave N-terminal peptides free in so- 25 
lution. These peptides can then be analysed by mass 
spectrometry. It may be desirable to fractionate the pep- 
tides prior to mass spectrometry but this fractionation 
step is optional. Alternatively a modified C-terminal se- 
quencing agent might be used to capture proteins by the 30 
C-terminus. The C-termlnus is generally not post-trans- 
lationally modified and so may be the preferred terminus 
to capture a population of proteins. Further embodi- 
ments of this invention are discussed below. 

35 

C-terminal sequencing agents: 

[0035] Unmodified C-terminal sequencing agents can 
be used to generate a signature peptide. A further em- 
bodiment of the present invention is as follows and is *o 
described schematically In Figure 1. In the first step a 
protein population extracted from a tissue is loosely im- 
mobilised onto a membrane, such as a PVDF mem- 
brane. The solvents used to extract proteins from a tis- 
sue S3mp!e are generally very harsh, usually containing 45 
agents such as urea, thiourea and detergents, since 
proteins have widely varying solubilities. Immobilising 
extracted proteins onto a membrane allows them to be 
washed with other solvents prior to modification. The 
protein population, thus captured, is then derivatised so 
with a coupling agent, such as diphenyl phosphoroi- 
sothiocyanatidate from Hewlett-Packard, (Miller et a!., 
Techniques in Protein Chemistry VI 21 9 - 227) in a meth- 
od that is essentially the same as that which one would 
use for a normal sequencing reaction for a single protein 55 
giving peptidylacylisothiocyanates for all proteins. The 
coupling reagent also reacts with other free carboxyl 
groups also giving acylisothiocyanate derivatives. The 



coupling agent may, however, react incompletely with 
some carboxylic acid side chains. It may, therefore, be 
desirable to perform additional derivitisation steps using 
more reactive reagents to ensure that all free carboxyl 
groups are derivatised. This variation is shown in Figure 
4. The derivatised protein population is then treated with 
pyridine to effecting ring closure of the terminal acyli- 
sothiocyanate derivative. One can then cleave the C- 
terminal residue by addition of a cleavage agent such 
as trimethylsilanolate, from Hewlett-Packard, which 
cleaves the terminal amino acid from each protein re- 
leasing the thiohydantoin-amlno acid derivative of the 
terminal amino acid. This exposes a free carboxyl at the 
penultimate residue of each protein. This can be specif- 
ically derivatised with biotin using 5 - (biotimamido) 
pentylamine since all other carboxyl groups are deriva- 
tised. In this way all the proteins in a population can be 
derivatised at the C-terminal with biotin. The biotinyiated 
population, still on the PVDF membrane is then treated 
with an appropriate sequence specific cleavage agent. 
Trypsin is generally used for mass spectrometry appli- 
cations as this generally leaves the N-terminal side of 
the cleavage site protonated which is desirable. Trypsin 
specifically cleaves adjacent to basic residues. If an en- 
zyme is used the immobilised peptides would have to 
be washed with some form of physiological buffer to al- 
low trypsin to function. This will leave a population of 
cleaved peptides, some of which are biotinyiated which 
can be desorbed from the PVDF membrane into solu- 
tion. The biotinyiated peptides can be captured using a 
solid phase matrix derivatised with monomeric avldln. 
Non-immobilised peptides can then be washed away, 
leaving an immobilised population of C-terminal pep- 
tides which comprise the tag used to identify proteins in 
a population. After washing away free peptides, the im- 
mobilised tags can be released from the solid phase 
support by addition of acid which disrupts the biotin/avi- 
din interaction - monomeric avidin is best for this pur- 
pose. In an alternative embodiment the biotinyiated pep- 
tides can be captured on an avidinated support prior to 
sequence specific cleavage. 

N-terminal sequencing agents: 

[0036] N-termini of a large proportion of cellular pro- 
teins are blocked. For the purposes of profiling those 
proteins whose N-termini are not blocked one can use 
the corresponding N-terminal sequencing agents to 
derivatise amino groups including the terminal amino 
group. The terminal amino acid can be cleaved and the 
newly exposed amine at the penultimate amino acid can 
be derivatised with an immobilisation agent The bioti- 
nyiated proteins can then be cleaved and the terminal 
signature peptides can be captured and analysed. This 
would however be limited to those N-termini that are not 
already blocked. 
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Format 2: 

[0037] This method is shown schematically in Figure 
2. In this method a reagent that derivatises carboxyl res- 
idues is used to cap all carboxyl residues including the s 
C-terminal carboxyl group in the protein population of 
interest. The protein population is then cleaved with 
trypsin or another sequence specific cleavage reagent 
that cleaves at the peptide bond to generate an amino 
and carboxyl group on the C-terminal and N-terminal 10 
fragments respectively. At this stage all peptides except 
the terminal peptides, which are capped will have a free 
carboxyl. These free carboxyls can be derivatised with 
5 - (biotimamido)pentylamine or some other immobili- 
sation agent. If biotin is used then one can capture all 15 
the biotinylated non C-terminal peptides onto a solid 
phase matrix derivatised with avidin. An avidinated af- 
finity column in-line with a mass spectrometer would al- 
low C-terminal peptides to be selectively eluted directly 
into the mass spectrometer for analysis. 20 
[0038] This technique is equally applicable to gener- 
ating peptide tags from the N-terminus of a population 
of proteins. Reagents which derivatise amine groups 
can be used to selectively cap all amine groups on a 
protein including the N-terminal amine group. Cleavage 25 
will expose amines in non-terminal peptides which can 
be derivatised with biotin allowing selective capture of 
non N-terminal peptides. This is important since many 
proteins are modified at the N-terminus and the N-ter- 
minal amine is often inaccessible to reagents. Thus se- 30 
lectively capturing non N-terminal peptides is a means 
of generating a signature at the N-terminus. 
[0039] The reagents to derivatise amines and carbox- 
yls are also simpler than those necessary for the cou- 
pling agents used in sequencing reactions. 35 

Immobilisation agents: 

[0040] It is possible to capture derivatised peptides 
with a variety of chemical agents. In the discussion of *o 
the methods of this invention biotin has been chosen as 
an exemplary immobilisation agent due to its highly spe- 
cific interactions with avidin. Other immobilisation 
agents besides biotin are compatible with the methods 
of this invention. The following are examples and the 45 
invention is not limited to these. 
[0041] A linker to hexahistidine would allow peptide 
tags to be captured onto a coordinated metal ion deri- 
vatised column. Various antibody antigen interactions 
could be used as well where an antibody or antigen is so 
tagged onto the penultimate amino acid rather than bi- 
otin. 

Antibodies against derivatives: 

55 

[0042] The most common N-terminal modification is 
acetylation. It should be possible to raise an antibody 
against N-terminal ly acetylated peptides to permit these 



to be captured using an affinity column derivatised with 
such an antibody. In order to capture substantially all 
proteins one can derivatise the remaining proteins in a 
sample, that are not already acetylated, with an acetyla- 
tion agent The derivatised proteins can then be cleaved 
with chymotrypsin or another sequence specific agent 
(trypsin does not cleave acetylated cleavage sites of 
proteins). An anti-N-terminal acetylation antibody immo- 
bilised on an appropriate matrix could be used to gen- 
erate an affinity column. Such a column could be used 
to capture peptide signatures with acetylated N-termini 
after their source proteins have been cleaved. 
[0043] To capture C-terminal peptides one could raise 
an antibody against thiohydantoin derivatives of pep- 
tides which could be used to selectively capture a pep- 
tide from a protein that had been derivatised with a cou- 
pling agent for sequencing prior to cleavage with trypsin 
or another sequence specific cleavage agent 

Derivltisation of proteins: 

[0044] The methods of this invention include deriviti- 
sation steps which are required to ensure that the refer- 
ence terminus of each protein in a population is specif- 
ically derivatised with an immobilisation agent in the first 
format or, in the second format, to ensure that the refer- 
ence terminus is specifically blocked from reaction with 
an immobilisation agent Additional derivitisation steps 
may also be performed. These may be desirable if frac- 
tionation of signature peptides is to be performed prior 
to mass spectrometry analysis. There are two important 
factors that should be considered with regard to any 
fractionation steps. These factors are the resolution of 
the fractionation step and the consequent sample loss 
imposed by the fractionation. 
[0045] Certain chromatographic techniques are 
'sticky' when used for the separation of peptides, that is 
to say a proportion of the sample is retained on the sep- 
aration matrix. It is possible to reduce sample loss of 
this kind by derivitising the groups that are involved in 
adhesion to the separation matrix. That is to say, if one 
is using an ion exchange chromatography separation 
one can derivatise ionic and polar side chains with rea- 
gents that increase their hydrophobicity thus reducing 
affinity to the matrix. This will, however, reduce the res- 
olution of the separation. 

[0046] It is desirable to ensure that only one mass 
peak per peptide appears in the mass spectrum gener- 
ated by analysis of a population of signature peptides. 
It may, therefore, be desirable to derivatise polar and 
ionic side chains of signature peptides in order to reduce 
the number of ionisation states accessible to those pep- 
tides. This step should help promote the formation of a 
single ion species per signature peptide. 
[0047] It may also be desirable to add a group to each 
signature peptide to increase the sensitivity of the mass 
spectrometry analysis. A particularly good 'sensitising 1 
group to add to a peptide would be a tertiary ammonium 
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ion which is a positively charged entity with excellent 
detection properties. 

Pre-Sorting Steps: 

[0046] This technology can be used to profile peptide 
populations generated in numerous ways. Various frac- 
tionation techniques exist to sub-sort proteins on the ba- 
sis of certain features. Of particular interest is the anal- 
ysis of signalling pathways. Phosphorylation of proteins 
by kinases is a feature of many signalling pathways. 
Proteins that can be phosphorylated by a kinase often 
have a short phosphorylation motif that a kinase recog- 
nises. Antibodies exist that bind to such motifs, some 
binding phosphorylated forms while others bind the non- 
phosphorylated state. Antibody affinity columns or irrv 
muno-precipitatlon of kinase target sub-populations fol- 
lowed by profiling would be of great interest in identifying 
these proteins and in monitoring their metabolism simul- 
taneously in time resolved studies of live model sys- 
tems. 

[0049] Many proteins exists as complexes and anal- 
ysis of such complexes is often tricky. A cloned protein 
that is a putative member of a complex allows one to 
generate an affinity column with that protein to trap other 
proteins that bind to it. This profiling technology is emi- 
nently suited to analysis of such captured protein com- 
plexes. 

Kits including antibody affinity columns to analyse signal 
transduction or membrane location by capturing pro- 
teins with the appropriate post-translational modifica- 
tions are envisaged either as a pre-sorting step or as a 
capture step after cleavage of a protein population with 
a sequence specific cleavage agent 

Chromatographic techniques: 

[0050] Having generated peptide tags from a popula- 
tion of proteins it is then desirable to analyse the result- 
ant tags. Chromatography is an optional step in the anal- 
ysis of a population of peptide signatures prior to mass 
spectrometry but may be quite desirable depending on 
the configuration of the mass spectrometer used. 
[0051] Two important features are required of any 
chromatographic stage in a protein profiling method, 
high resolution and minimal sample loss. Resolution 
generates information and also reduces the complexity 
of the peptide tag population entering the mass spec- 
trometer. The second feature is that there is minimal loss 
of sample in the chromatographic separation, that would 
reduce the sensitivity of the technique to low frequency 
peptides in the population under analysis. 

Derivitisation of proteins: 

[0052] Certain chromatographic techniques are 
'sticky* when used for the separation of peptides, that is 
to say a proportion of the sample is retained on the sep- 



aration matrix. To reduce sample loss of this kind Is pos- 
sible by derivitising the groups that are involved in ad- 
hesion to the separation matrix That is to say, if one Is 
using an ion exchange chromatography separation one 

5 can derivitise ionic and polar side chains with reagents 
that Increase their hydrophobic'rty thus reducing affinity 
to the matrix. This feature needs to be balanced against 
the need for resolution though. 
[0053] The use of the C-terminal sequencing agents 

10 to derivitise the free carboxyi groups which will reduce 
the adhesion between such peptides and a cation ex- 
change resin. This may mean that cation exchange 
chromatography may be advantageous as a chromato- 
graphic separation step. 

is [0054] One can derivitise quite readily acetylate 
amine residues to achieve similar effects for anion ex- 
change chromatography. 



20 



Analysis of peptides by mass spectrometry: 

tonisation Techniques: 



[0055] In general peptide mixtures are injected into 
the mass spectrometer by electrospray or MALDI TOF, 
25 which leaves them in the vapour phase. 

Electrospray lonisation: 

[0056] Electrospray lonisation requires that the dilute 

30 solution of blomolecule be 'atomised* into the spectrom- 
eter from an insertion probe, i.e. in a fine spray. The so- 
lution is, for example, sprayed from the tip of a needle 
in an electrostatic field gradient The mechanism of lon- 
isation is not fully understood but is thought to work 

35 broadly as follows. The electrostatic field charges drop- 
lets formed at the probe tip promoting atomisation. In 
the stream of nitrogen the solvent is evaporated. With a 
small droplet, this results in concentration of the biomol- 
ecule. Given that most blomolecules have a net charge 

40 this increases the electrostatic repulsion of the dis- 
solved protein. As evaporation continues this repulsion 
ultimately becomes greater than the surface tension of 
the droplet and the droplet 'explodes' into smaller drop- 
lets. The electrostatic field helps to further overcome the 

45 surface tension of the charged droplets. The evapora- 
tion continues from the smaller droplets which, in turn, 
explode iteratively until essentially the biomolecules are 
in the vapour phase, as is all the solvent 

so Atmospheric Pressure Chemical lonisation: 

[0057] An lonisation technique appropriate for use 
with LCMS, for analysing peptides is Atmospheric Pres- 
sure Chemical lonisation (APCI). This is an electrospray 
55 based technique where the lonisation chamber is mod- 
ified to include a discharge electrode which can be used 
to ionise the bath gas which In turn will collide with the 
vaporised sample molecules increasing ionisation of the 
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sample. 

Fast Atom Bombardment: 

[0058] This is an ionisation technique that is quite sim- 
ilar to APCI and is highly compatible with samples in so- 
lution. Typically, a continuous flow of liquid from a cap- 
illary electrophoresis column or an HPLC column can 
be pumped through an insertion probe to a hole or a frit 
at its tip where the solution is bombarded by accelerated 
atoms or ions, usually of xenon or caesium. Collision 
with the dissolved sample results in transfer of kinetic 
energy to and ionisation of the sample. 

Matrix Assisted Laser Desorption Ionisation (MALDI) : 

[0059] MALDI requires that the biomolecuie solution 
be embedded in a large molar excess of an photo-ex- 
citable 'matrix*. The application of laser light of the ap- 
propriate frequency (266 nm beam for nicotinic acid ) 
results in the excitation of the matrix which in turn leads 
to excitation and ionisation of the embedded biomole- 
cuie. This technique imparts a significant quantity of 
translations energy to ions, but tends not to induce ex- 
cessive fragmentation despite this. Accelerating voltag- 
es can again be used to control fragmentation with this 
technique though. 

[0060] MALDI techniques can be supported in two 
ways. One can embed proteins in a MALDI matrix, 
where the proteins themselves are not specifically ex- 
citable by laser or one can construct peptide labels that 
contain the necessary groups to allow laser energisa- 
tion. The latter approach means the labels do not need 
to be embedded in a matrix before performing mass 
spectrometry. Such groups Include nicotinic, sinapinic 
or cinnamic acid moieties. MALDI based cleavage of la- 
bels would probably be most effective with a photocleav- 
abie linker as this would avoid a cleavage step prior to 
performing MALDI mass spectrometry. The various ex- 
citable ionisation agents have different excitation fre- 
quencies so that a different frequency can be chosen to 
trigger ionisation from that used to cleave the photolys- 
able linker. These excitable moieties are easily derivi- 
tised using standard synthetic techniques in organic 
chemistry so labels with multiple masses can be con- 
structed in a combinatorial manner. 
[0061] All of the above techniques are routinely used 
with peptides and proteins and are preferred methods 
of ionisation with this invention. 

Mass Spectrometric Sensitivity and Quantitation of 
peptide tags: 

[0062] The end result is then that a population of pro- 
teins can be arbitrarily sorted into populations of pep- 
tides of convenient size to be fed into a mass spectrom- 
eter for analysis. Completion of such an analysis for an 
entire cell's proteins would give a profile of what proteins 



are present and in what relative quantities. Absolute 
quantitation could be acheived by 'spiking' a protein 
population with known quantities of particular proteins, 
known to be absent, e.g. plant proteins in animal sam- 

5 pies or visa versa, against which to calibrate results. In- 
ternal quantities can be determined by measuring rela- 
tive quantities of certain proteins present at relatively 
fixed concentrations in most cells such as histones. Var- 
ious techniques coupled to certain mass spectrometer 

10 geometries permit good quantitation with a mass spec- 
trometer. These issues are dealt with fully in QB 
9719284.3. 

Mass Analyser Geometries: 

15 

[0063] Mass spectrometry is a highly diverse disci- 
pline and numerous mass analyser configurations exist 
and which can often be combined in a variety of ge- 
ometries to permit analysis of complex organic moie- 
20 cules such as the peptide tags generated with this in- 
vention. 

Accurate Mass Measurement: 

25 [0064] Double focussing mass spectrometers are ca- 
pable of measuring molecular masses to a very high ac- 
curacy, i.e. fractions of a daiton. This permits one to dis- 
tinguish molecules with identical integer mass but dif- 
ferent atomic compositions with ease as fractional dif- 

30 ferences in the mass of different atomic Isotopes allow 
such distinctions. For determining the molecular mass- 
es of a population of peptide tags, this technique may 
be very effective as it would allow identification of a sig- 
nificant proportion of peptides without requiring any se- 

35 quencing even if some do have the same integral mass. 
The few ambiguous peptides that remain could be ana- 
lysed by tandem mass spectrometry as discussed be- 
low. 

40 Sequencing of peptide tags by Tandem mass 
spectrometry: 

[0065] Peptides can be readily sequenced by tandem 
mass spectrometry. Tandem mass spectrometry de- 

45 scribes a number of techniques in which a ions from a 
sample are selected by a first mass analyser on the ba- 
sis of their mass charge ratio for further analysis by in- 
duced fragmentation of those selected ions. The frag- 
mentation products are analysed by a second mass an- 

so alyser. The first mass analyser in a tandem instrument 
acts as a fitter selecting ions to enter the second mass 
analyser on the basis of their mass charge ratio, such 
that essentially a species of only a single mass/charge 
ratio, usually only a single peptide ion, enter the second 

55 mass analyser at a time. On leaving the first mass ana- 
lyser, the selected peptide passes through a collision 
chamber, which results in fragmentation of the peptide. 
Since fragmentation occurs mostly at the peptide bond, 
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the pattern of fragments corresponds to a series of sub- 
species of peptides and amino acids that compose the 
original peptide. The distinct pattern of masses of single 
amino acids, 2-mers, 3-mers, etc. generated in the frag- 
mentation of a peptide is sufficient to identify its se- 5 
quence. 

ION SOURCE -> MS1 -> COLLISION CELL -> MS2 -> 
ION DETECTOR 

[0066] Various tandem geometries are possible. Con- 
ventional 'sector 1 instruments can be used where the 10 
electric sector provide the first mass analyser stage, the 
magnetic sector provides the second mass analyser, 
with a collision cell placed between the two sectors. This 
geometry is not ideal for peptide sequencing. Two com- 
plete sector mass analysers separated by a collision cell 15 
could be used for peptide sequencing. A more typical 
geometry used is a triple quadrupole where the first 
quadrupole filters ions for collision. The second quadru- 
pole in a triple quadrupole acts as a collision chamber 
while the final quadrupole analyses the fragmentation 20 
products. This geometry is quite favorable. Another 
more favorable geometry is a Quadrupole/Orthogonal 
Time of Flight tandem instrument where the high scan- 
ning rate of a quadrupole is coupled to the greater sen- 
sitivity of a TOF mass analyser to identify the products & 
of fragmentation. 

Sequencing with ion Traps: 

[00671 Ion Trap mass spectrometers are a relative of 30 
the quadrupole spectrometer. The ion trap generally has 
a 3 electrode construction - a cylindrical electrode with 
'cap' electrodes at each end forming a cavity. A sinusoi- 
dal radio frequency potential is applied to the cylindrical 
electrode while the cap electrodes are biased with DC 35 
or AC potentials. Ions injected into the cavity are con- 
strained to a stable circular trajectory by the oscillating 
electric field of the cylindrical electrode. However, for a 
given amplitude of the oscillating potential, certain ions 
will have an unstable trajectory and will be ejected from *o 
the trap. A sample of ions injected into the trap can be 
sequentially ejected from the trap according to their 
mass/charge ratio by altering the oscillating radio fre- 
quency potential. The ejected ions can then be detected 
allowing a mass spectrum to be produced. <s 
[0068] Ion traps are generally operated with a small 
quantity of a 'bath gas', such as helium, present in the 
ion trap cavity. This increases both the resolution and 
the sensitivity of the device by collision with trapped 
ions. Collisions both increase ionisation when a sample so 
is introduced into the trap and damp the amplitude and 
velocity of ion trajectories keeping them nearer the cen- 
tre of the trap. This means that when the oscillating po- 
tential is changed, ions whose trajectories become un- 
stable gain energy more rapidly, relative to the damped ss 
circulating ions and exit the trap in a tighter bunch giving 
a narrower larger peaks. 

[0069] Ion traps can mimic tandem mass spectrome- 



ter geometries, in fact they can mimic multiple mass 
spectrometer geometries allowing complex analyses of 
trapped ions. A single mass species from a sample can 
be retained in a trap, i.e. all other species can be ejected 
and then the retained species can be carefully excited 
by super-imposing a second oscillating frequency on the 
first The excited ions will then collide with the bath gas 
and will fragment if sufficiently excited. The fragments 
can then be analysed further. One can retain a fragment 
ion for further analysis by ejecting other ions and then 
exciting the fragment ion to fragment This process can 
be repeated for as long as sufficient sample exists to 
permit further analysis. It should be noted that these in- 
struments generally retain a high proportion of fragment 
ions after induced fragmentation. These instruments 
and FTICR mass spectrometers (discussed below) rep- 
resent a form of temporally resolved tandem mass spec- 
trometry rather than spatially resolved tandem mass 
spectrometry which is found in linear mass spectrome- 
ters. 

[0070] For the purposes of protein profiling a peptide 
population, an ion trap is quite a good instrument. A 
sample of peptide tags can be injected into the spec- 
trometer. Peptide tags that are expected to appear in a 
profile, such as housekeeping proteins or histone pep- 
tides from eukaryote cell samples, can be ejected spe- 
cifically and quantified rapidly. The remaining peptides 
can be scanned. Totally new peptides can then be se- 
lectively retained from subsequent samples of the pep- 
tide population and can be induced to fragment allowing 
sequence data for that peptide to be acquired. Alterna- 
tively an Ion Trap can form the first stage of a tandem 
geometry instrument 

Fourier Transform ion Cyclotron Resonance Mass 
Spectrometry (FTICR MS): 

[0071] FTICR mass spectrometry has similar features 
to ion traps in that a sample of ions is retained within a 
cavity but in FTICR MS the ions are trapped in a high 
vacuum chamber by crossed electric and magnetic 
fields. The electric field is generated by a pair of plate 
electrodes that form two sides of a box. The box is con- 
tained in the field of a superconducting magnet which in 
conjunction with the two plates, the trapping plates, con- 
strain injected ions to a circular trajectory between the 
trapping plates, perpendicular to the applied magnetic 
field. The ions are excited to larger orbits by applying a 
radiofrequency pulse to two transmitter platesSvhich 
form two further opposing sides of the box. The cycloidal 
motion of the ions generate corresponding electric fields 
in the remaining two opposing sides of the box which 
comprise the 'receiver plates*. The excitation pulses ex- 
cite ions to larger orbits which decay as the coherent 
motions of the ions is lost through collisions. The corre- 
sponding signals detected by the receiver plates are 
converted to a mass spectrum by fourier transform anal- 
ysis. 
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[0072] For induced fragmentation experiments these 
instruments can perform in a similar manner to an ion 
trap - all ions except a single species of interest can be 
ejected from the trap. A collision gas can be introduced 
into the trap and fragmentation can be induced. The 
fragment ions can be subsequently analysed. Generally 
fragmentation products and bath gas combine to give 
poor resolution if analysed by FT of signals detected by 
the 'receiver plates', however the fragment ions can be 
ejected from the cavity and analysed in a tandem con* 
figuration with a quadrupole, for example. 
[0073] For protein profiling FTICR MS could be used 
and may be advantageous as these instruments have a 
very high mass resolution allowing for accurate mass 
measurement so that peptides with the same integer 
mass but different atomic compositions can be resolved. 
Furthermore unidentified peptide tags can be subse- 
quently analysed by fragmentation. 

Protein immobilisation: 

[0074] A great deal of knowledge has been accumu- 
lated about specific protein chemistries particularly in 
the area of organic synthesis of peptides. 

• R.B. Merrifield, Science 232: 341 -347, 1986. 

• S.B.H. Kent, "Chemical Synthesis of Peptides and 
Proteins", Annu. Rev. Biochem. 1988. 57: 957-989. 

Linkers: 

[0075] An important feature of this invention is cleav- 
able (inkers to their relevant biomolecules. Photocleav- 
able linkers are particularly desirable as they allow for 
rapid, reagentless cleavage. For references, see: 

• Theodora W. Greene, "Protective Groups in Organ- 
ic Synthesis", 1981, Wiley-lnterscience. 

On photoremovable groups: 

[0076] 

• Patchornik, J. Am. Chem. Soc. 92: 6333 - , 1970. 

• Amit et al, J. Org. Chem. 39: 192 - , 1974. 



ferred technique for sequencing peptides since it is a 
very soft technique and can be directly coupled to the 
liquid phase molecular biology used in this invention. 
For a full discussion of mass spectrometry techniques 
5 see: 

• K. Biemann, "Mass Spectrometry of Peptides and 
Proteins", Annu. Rev. Biochem. 1992. 61: 977 - 
1010. 

to • RAW. Johnstone and M.E. Rose, "Mass Spec- 
trometry for chemists and biochemists" 2nd edition, 
Cambridge University Press, 1996. 

Experiment 

Outline of embodiment of protein profiling 
[0079] This comprises a system where 

(i) a protein has its carboxyt groups protected, the 
last amino acid removed leaving just one carboxyl 
group tree at the cleaved terminus. 

(ii) This will be reacted with a biotinylation reagent, 
so that the carboxy terminus is labelled with biotin. 

(iii) The protein is fragmented with a protease to 
leave peptide fragments, only the carboxyl one be- 
ing biotinylated. 

[0080] The biotin is used to attach the C terminal frag- 
ment to immobilised streptavidin, or preferably mono- 
meric avidin, from which it can be released with mild acid 
and made available for MS - MS. 
[0081] All reagents are available, and the chemistry 
is generally well-known as follows: 

(i) The technique of carboxy-terminal sequencing of 
proteins is established. We note that the method of 
Boyd et al., (Boyd,, VL, Bozzini, M, Guga, PJ, De- 
Franco, RJ, Yuan, P-M, Loudon, GM and Nguyen, 
D; J. Org Chem, 60, 2581, (1995)) blocks the side 
chain carboxyls of aspartate and glutamate resi- 
dues by amidation during removal of the terminal 
amino acid. 
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Liquid Chromatography: 
[0077] 

• R. Scopes, "Protein Purification: Principles and 
Practice", Springer-Veriag, 1982. 

• M. Deutscher, "Guide to Protein Purification", Aca- 
demic Press, 1990. 

Mass Spectrometry: 

[0078] Electrospray mass spectrometry is the pre- 



(ii) Biotinylation of the free carboxyl group at the car- 
boxy terminus may be achieved using 5-(biotimami- 
do)pentylamine/1-ethyl-3-[3-dimethylaminopropyl] 
so carbodiimide hydrochloride, which is marketed by 
Pierce & Warriner (Lee, KY, Birckbichier, PJ and 
Patterson, MK, Clin Chem, 34, 906 (1988) for such 
a purpose. 

55 (Hi) Protease fragmentation of proteins on a mem- 
brane is an established technique (Sutton, CW, 
Pemberton, KS, Cottrell, JS, Corbett, JM, Wheeler, 
CH, Dunn, MJ and Pappin, DJ, Electrophoresis, 16, 
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308, (1995), and Miilipore Corporation produce Im- 
mobilon®-CD and other PVDF membranes for that 
purpose. Monomelic avidin Is produced by Pierce 
and Warriner, and allows release of biotinylated 
molecules using 2mM biotin in phosphate buffered $ 
saline. 



[0082] The remaining step In the method is the use of 
PVDF membranes (as used for trypsinisation) in lieu of 
Zitex® membranes for the sequencing reaction (I). 10 

Methodology 

Binding of lysozyme to PVDF membrane 

15 

[0083] 0.5mm squared pieces of PVDF (Miilipore) 
were wetted with isopropanoi and incubated in 20mg/ml 
lysozyme (Pharmacia) in PBS at room temperature for 
30 minutes. The membranes were then air dried and 
stored at 4oC until used. 20 

Modification (carboxyl group protection) of 
lysozyme bound to PVDF 

[0084] Modification solution was prepared by mixing 25 
62mg of 2-ethyl-5-phenylisoxazolium-3 , sulfonate 
(Aldrich) with 50ul of diisopropylethylamine (Aldrich) in 
2mls of CH 3 CN 100ul of modification solution was add- 
ed to each membrane and incubated at room tempera- 
ture for 4 hours. 30 
[0085] Following incubation 900ul of water was added 
and each membrane was gently shaken at room tem- 
perature for 30 minutes. Each membrane was then 
transferred to 50ul of CH 3 CN, 450ul of water was added 
and the membranes were gently shaken at room tern- 35 
perature for 30 minutes. 

[0086] The each membrane was then transferred to 
500ul of 2% trtfiuoroacetic acid and incubated at room 
temperature overnight 

40 

Trypsin digest 

[0087] Each membrane was transferred to 250ul of 
25mM ammonium bicarbonate pH7.6 solution and gen- 
tly shaken at room temperature for 15 minutes. 45 
[0088] Each protein/protein containing membrane 
was added/transferred to 200ul of ammonium bicarbo- 
nate solution pH7.6 containing 5ug of trypsin and incu- 
bated at 37°C overnight 

50 

Eluation of protein/peptide fragments from 
membrane 

[0089] Each membrane was transfered to 100ul of 
50% formic add/50% ethanol solution and incubated at 55 
room temperature for 30 minutes to remove the protein/ 
peptides. The membranes were then removed and 
300ul of water added to the 50% formic acid/50% etha- 



nol solution containing the protein/peptides. 
Analysis 

[0090] The following were analysed by reversed 
phase HPLC 

[0091] 40ug of trypsin In PBS; 40ug of lysozyme in 
PBS; 40ug of lysozyme digested with trypsin; 40ug of 
trypsin digested with trypsin; membrane bound modified 
lysozyme digested with trypsin; membrane put through 
the modification protocol without lysozyme and digested 
with trypsin; membrane bound lysozyme unmodified 
and digested with trypsin; membrane bound lysozyme 
modified without trypsin digestion. 

Results 

[0092] We have now performed the operation using 
PVDF membranes in lieu of Zitex® membranes for the 
sequencing reaction (I). We have found that the re- 
versed phase HPLC chromatogram for lysozyme (used 
as a typical protein) obtained after treatment with the 
sequencing reactions on a PVDF membrane and 
trypsinisation, from which the chromatogram for the 
same process in the absence of lysozyme has been 
subtracted, is similar to that obtained for lysozyme 
trypsinised directly. Hence the technologies are compat- 
ible and can be used to generate 'signature* peptides for 
MS-MS identification (data not shown). 

KEY TO THE DRAWINGS: 

FIGURE 1 

[0093] 

Step 1 : Extract proteins with harsh solvents and cap- 
ture extracted proteins onto a PVDF mem- 
brane 

Step 2: Loosely Immobilised proteins can be washed 
to dispose of harsh solvents 

Step 3: Treat proteins with C-terminal coupling agent 

Step 4: Treat derivitised proteins with cyclisation re- 
agent and then cleave terminal amino acid 
from derivitised protein 

Step 5: Biotinylate newly exposed penultimate ami- 
no acid carboxyl group 

Step 6: Wash membrane bound proteins to remove 
chemical agents and cleave proteins with 
trypsin in physiological buffer 

Step 7: Capture terminal fragments onto avidinated 
beads 

Step 8: Wash away free peptides then release cap- 
tured peptide tags' for analysis 
Step 9: Analyse by MS or LC/MS/MS or MS/MS 
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FIGURE 2 
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Stepl: 



Step 2: 

Step 3: 
Step 4: 



Step 5: 
Step 6: 

Step 7: 



Extract proteins with harsh solvents and cap- 
ture extracted proteins onto a PVDF mem- 
brane 

Loosely immobilised proteins can be washed 
to dispose of harsh solvents 
Treat proteins with C-terminat coupling agent 
Wash membrane bound proteins to remove 
chemical agents and cleave proteins with 
trypsin or other sequence specific cleavage 
agent in in physiological buffer 
Biotinylate newly exposed carboxyl termini 
Capture terminal fragments onto avidinated 
beads in an affinity column for example 
Analyse eluted C-terminal by MS or LC/MS/ 
MS or MS/MS 
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Extract proteins with harsh solvents and cap- 
ture extracted proteins onto a PVDF mem- 
brane 

Loosely immobilised proteins can be washed 

to dispose of harsh solvents 

Treat proteins with C-terminal coupling agent 

carrying immobilisation effector 

Wash membrane bound proteins to remove 

chemical agents and cleave proteins with 

trypsin in physiological buffer 

Capture terminal fragments onto avidinated 

beads 

Wash away free peptides then release cap- 
tured peptide tags' for -analysis 
Analyse by MS or LC/MS/MS or MS/MS 



Extract proteins with harsh solvents and 
capture extracted proteins onto a PVDF 
membrane 

Loosely immobilised proteins can be 
washed to dispose of harsh solvents 
Treat proteins with C-terminal coupling 
agent 

Treat coupled proteins with derivitisation re- 
agent to ensure all exposed carboxyls are 
capped 

Treat derivitised proteins with cyclisation re- 
agent and then cleave terminal amino acid 
from derivitised protein 
Biotinylate newly exposed penultimate ami- 
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no acid carboxyl group 
Step 7: Wash membrane bound proteins to remove 

chemical agents and cleave proteins with 

trypsin in physiological buffer 
Step 8: Capture terminal fragments onto avidinated 

beads 

Step 9: Wash away free peptides then release cap- 
tured peptide tags' for analysis 
Step 10: Analyse by MS or LC/MS/MS or MS/MS 
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A method for characterising polypeptides, which 
comprises : 

(a) treating a sample comprising a population 
of a plurality of polypeptides with a cleavage 
agent which is known to recognise in polypep- 
tide chains a specific amino acid residue or se- 
quence and to cleave at a cleavage site, where- 
by the peculation is cleaved to generate peptide 
fragments; 

(b) isolating a first population of peptide frag- 
ments which comprises only terminal peptide 
fragments bearing as a reference terminus the 
N-terminus orthe C-terminus of the polypeptide 
from which they were fragmented, from a sec- 
ond population of peptide fragments which do 
not comprise the reference terminus, each pep- 
tide fragment of the first population bearing at 
the other end the cleavage site proximal to the 
reference terminus, said isolation step compris- 
ing immobilisation of the first population of pep- 
tide fragments or immobilisation of the second 
population of peptide fragments onto a solid 
support, and. 

(c) determining by mass spectrometry a signa- 
ture sequence of at least some of the isolated, 
fragments, which signature sequence is the se- 
quence of a predetermined number of amino 
acid residues running from the cleavage site; 

wherein a signature sequence characterise each 
polypeptide. 

2. A method according to claim 1, wherein the refer- 
ence terminus is attached to a solid phase support 
to immobilise the population of polypeptides or pep- 
tide fragments thereof. 

3. A method according to claim 2, wherein the popu- 
lation of polypeptides is immobilised before treat- 
ment with the cleavage agent 

4. A method according to claim 2 or claim 3, wherein 
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the reference terminus is attached to the solid 
phase support by: (i) treating the polypeptides with 
a blocking agent to block all exposed reference 
groups, which comprise either carboxyt groups or 
primary amine groups; (ii) cleaving the reference 
terminal amino acids to expose unblocked refer- 
ence termini; and iii) treating the unblocked refer- 
ence termini with an immobilisation agent capable 
of coupling to the solid phase support; wherein step 
(b) comprises binding the treated reference termini 
to the solid phase support and removing unbound 
peptide fragments. 

5. A method according to claim 1 , which further com- 
prises 

(i) preparing the sample step (a) by pre-treating 
the polypeptides with a blocking agent to block 
all exposed reference groups, which comprise 
either carboxyl groups or primary amine 
groups, so that subsequent treatment of the 
sample with the cleavage agent generates pep- 
tide fragments bearing unblocked reference 
termini; 

(ii) treating the unblocked reference termini 
with an immobilisation agent capable of cou- 
pling to a solid phase support ; and 

(iii) binding the peptide fragments containing 
the unblocked reference termini to the solid 
phase support; wherein step (b) comprises 
etuting unbound peptide fragments therefrom. 

6. A method according to claim 4 or claim 5, wherein 
the immobilisation agent comprises a biotinylation 
agent 

7. A method according to any one of claims 4 to 6, 
wherein the reference group is carboxyl. 

8. A method according to any one of the preceding 
claims, wherein the cleavage agent comprises a 
peptidase. 

9. A method "according to any one of the preceding 
claims, wherein the sample of step (a) comprises a 
sub-cellular fraction. 

10. A method according to any one of the preceding 
claims, which further comprises preparing the sam- 
ple of step (a) by liquid chromatography. 

1 1 . A method according to any preceding claim, where- 
in the mass spectrometry is preceded by a high 
pressure liquid chromatography step to resolve the 
peptide fragments. 

12. A method according to any one of claims 1 to 11, 
wherein the peptide fragments are subjected to ion 



exchange chromatography before step (c). 

13. A method according to any one of the preceding 
claims, wherein the predetermined number of ami- 

5 no acid residues Is from 3 to 30. 

14. A method for identifying polypeptides in a test sam- 
ple, which comprises characterising the polypep- 
tides in accordance with a method according to any 

10 one of the preceding claims, comparing the signa- 
ture sequences and relative positions of the cleav- 
age site obtained thereby with the signature se- 
quences and relative positions of the cleavage site 
of reference polypeptides in order to identify the or 
each polypeptide in the test sample. 

15. A method for assaying for one or more specific 
polypeptides in a test sample, which comprises per- 
forming a method according to any one of claims 1 
to 14, wherein the cleavage agent and relative po- 
sition of the cleavage site is predetermined and the 
signature sequence is determined in step (c) by as- 
saying for a predetermined sequence of amino acid 
residues running from the cleavage site. 

16. A method according to claim 15, wherein the cleav- 
age site and signature sequence are predetermined 
by selecting corresponding sequences from one or 
more known target polypeptides. 



PatentansprOche 

1. Verfahren zur Charakterrsierung von Polypeptides 
umfassend: 
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(a) Behandeln einer Probe, welche eine Popu- 
lation einer Vielzahl von Polypeptiden umfasst, 
mit einem Spaltungsmittel, welches bekannter- 

40 maften in Polypeptidketten einen spezifischen 

Amlnosdurerest Oder eine Sequenz erkennt 
und an einer Schnittsteiie schneidet, wobei die 
Population geschnitten wird, urn Peptidfrag- 
mente zu erzeugen; 

45 (b) Trennen einer ersten Population von Pep- 

tidfragmenten, welche nur termlnale Peptid- 
fragmente umfasst, die als Bezugsende den 
N-Tenminus Oder den C-Terminus des Polypep- 
tids, von dem sle abgespalten wurden, tragen, 

so von einer zweiten Population von Peptidfrag- 

menten, welche nicht das Bezugsende umfas- 
sen, wobei jedes Peptidfragment der ersten 
Population am anderen Ende die Schnittsteiie 
proximal zum Bezugsende tr&gt, wobei der 

55 Trennungsschritt die Immobilisierung der er- 

sten Population von Peptidfragmente Oder die 
Immobilisierung der zweiten Population von 
Peptidfragmente auf einem festen TrSger um- 
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fasst, und 

(c) Ermrtteln einer Erkennugssequenz mittels 
Massenspektrometrie mindestens einiger Iso- 
lierter Fragmente, wobei die Erkennungsse- 
quenz die Sequenz einer vorbestimmten An- 5 
zahl von Aminosiureresten 1st, ausgehend von 
der Schnittstelle; 

wobei eine Erkennungssequenz jedes Polypeptid 
charakterisiert 10 

2. Verfahren nach Anspruch 1, wobei das Bezugsen- 
de an einen Festphasentrager angebracht ist, um 
die Popuiation von Polypeptiden oder Peptidfrag- 
menten zu immobilisieren. 15 

3. Verfahren nach Anspruch 2, wobei die Population 
von Polypeptiden vor der Behandlung mit dem 
Spaltungsmittel immobilisiert wird. 

20 

4. Verfahren nach Anspruch 2 Oder Anspruch 3, wobei 
das Bezugsende an den Festphasentrager ange- 
bracht wird, mittels: (i) Behandlung der Polypeptide 
mit einem Blockierungsmittel, um alie exponierten 
Bezugsgruppen zu blockieren, welche entweder 25 
Carboxylgruppen oder prim§re Aminogruppen um- 
fassen; (ii) Schneiden der Bezugsenden-Amino- 
siuren, um unblockierte Bezugsenden zu exponier- 
nen; und (Hi) Behandlung der unblockierten Be- 
zugsenden mit einem Immobilisierungsmittel, ge- 30 
eignetzum Koppeln an den Festphasentrager; wo- 
bei Schritt (b) die Bindung der behandelten Bezugs- 
enden an den Festphasentrager und die Entfernung 
ungebundener Peptidfragmente umfasst 

35 

5. Verfahren nach Anspruch 1 , welches weiterhin um- 
fasst: 

(i) Herstelien der Probe in Schritt (a) durch Vor- 
behandlung der Polypeptide mit einem Blockie- <o 
rungsmittel, um alle exponierten Bezugsgrup- 
pen zu blokkieren, welche entweder Carboxyl- 
gruppen oder primSre Aminogruppen umfas- 
sen, so dass eine darauffolgende Behandlung 
der Probe mit dem Spaltungsmittel Peptidfrag- <s 
mente erzeugt, welche unblockierte Bezugsen- 
den tragen; 

(ii) Behandlung der unblockierten Bezugsen- 
den mit einem Immobilisierungsmittel, geeignet 

fOr das Koppeln an einen Festphasentrager; so 
und 

(iii) Binden der Peptidfragmente, welche die un- 
blockierten Bezugsenden enthalten, an den 
Festphasentrager; wobei Schritt (b) die Elution 
von ungebundenen Peptidfragmenten um- 55 
fasst. 

6. Verfahren nach Anspruch 4 oder 5, wobei das Im- 
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mobilisierungsmittel ein Biotinylierungsmittel um- 
fasst. 

7. Verfahren nach irgendeinem der AnsprOche 4 bis 
6, wobei die Bezugsgruppe eine Carboxylgruppe 

ist 

8. Verfahren nach irgendeinem der vorangehenden 
AnsprOche, wobei das Spaltungsmittel eine Pepti- 
dase umfasst 

9. Verfahren nach irgendeinem der vorangehenden 
AnsprOche, wobei die Probe aus Schritt (a) eine 
subcellular Fraktion umfasst 

10. Verfahren nach irgendeinem der vorangehenden 
AnsprOche, welches weiterhin die Herstellung der 
Probe aus Schritt (a) mittels FIQssigkeitschromato- 
graphie umfasst 

11. Verfahren nach irgendeinem der vorangehenden 
AnsprOche, wobei der Massenspektrometrie eine 
HochleistungsflOssigkeitschromatographie voraus- 
geht, um die Peptidfragmente aufzuldsen. 

12. Verfahren nach irgendeinem der Anpruchel bis 11, 
wobei die Peptidfragmente vor Schritt (c) einer lo- 
nenaustauschchromatographie unterworfen wer- 
den. 

13. Verfahren nach irgendeinem der vorangehenden 
AnsprOche, wobei die vorbestimmte Anzahl von 
AminosSureresten von 3 bis 30 reicht 

14. Verfahren zur Identifizierung von Polypeptiden in ei- 
ner Testprobe, umfassend die Charakterisierung 
der Polypeptide in Oberefnstimmung mit einem Ver- 
fahren nach irgendeinem der vorangehenden An- 
sprOche, den Vergleich der Erkennungssequenzen 
und der relativen Positionen der dadurch erharte- 
nen Schnittstelle mit den Erkennungssequenzen 
und relativen Positionen der Schnittstelle von Be- 
zugspolypeptiden, um die Polypeptide oder jedes 
Polypeptid in der Testprobe zu identifizieren. 

15. Verfahren zur Untersuchung einer oder mehrerer 
spezifischer Polypeptide in einer Testprobe, umfas- 
send die DurchfGhrung eines Verfahrens nach ir- 
gendeinem der AnsprOche 1 bis 14, wobei das Spal- 
tungsmittel und die relative Position der Schnittstel- 
le vorbestimmt Ist, und die Erkennungssequenz in 
Schritt (c) durch Untersuchung auf eine vorbe- 
stimmte Sequenz oder AminosSurereste hin, aus- 
gehend von der Schnittstelle, bestimmt wird. 

16. Verfahren nach Anspruch 15, wobei die Schnittstel- 
le und die Erkennungssequenz mittels Auswihlen 
korrespondierender Sequenzen aus einem Oder 
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mehreren bekannten Zielpolypeptiden vorbestimmt 
werden. 



Revendications 5 

1. Procede pour la caracterisation de polypeptides, 
comprenant : 

(a) le traitement d'un echantillon contenant une 10 
population fbrmee d'une plurality de polypepti- 
des avec un agent de clivage dont on sait qu'il 
reconnait dans les chaTnes de polypeptides un 
residu ou une sequence d'acide amine specrfi- 
que et realise le clivage sur un site de clivage, * § 
de facon a diver la population pour produire 
des fragments de peptide ; 

(b) I'isolement d'une premiere population de 
fragments peptidiques composee uniquement 

de fragments peptidiques terminaux portant 20 
comme partie terminate de reference ia partie 
N-terminale ou la partie C-terminale du poly- 
peptide dont lis ont ete extraits par fragmenta- 
tion, par rapport a une seconds population de 
fragments peptidiques qui ne contiennent pas 25 
la partie terminals de reference, chaque frag- 
ment peptidique de la premiere population por- 
tant a I'autre extremite le site de clivage proxi- 
mal a la partie terminale de reference, ladite 
etape d'isolement comprenant ('immobilisation 30 
de la premiere population de fragments pepti- 
diques ou rimmobilisation de la seconde popu- 
lation de fragments peptidiques sur un support 
solide, et 

(c) la determination par spectrometrie de mas- 35 
se d'une sequence de signature de certains au 
moins des fragments isoles, laquelle sequence 

de signature est la sequence d'un nombre pre- 
determine de residus decide amine a partir du 
site de clivage ; <o 

dans lequel une sequence de signature ca- 
racterise chaque polypeptide. 

2. Precede selon la revendication 1, caract6rise en 45 
ce que la partie terminale de reference est fix6e a 

un support en phase solide afin d'immobiliser la po- 
pulation de polypeptides ou les fragments peptidi- 
ques de ceux-ci. 

so 

3. Precede selon la revendication 2, caracterise en 
ce que la population de polypeptides est immobili- 
ses avant le traitement par i'agent de clivage. 

4. Procede selon la revendication 2 ou 3, 55 
caracterise en ce que la partie terminale de refe- 
rence est fixee au support en phase solide par : 



(i) traitement des polypeptides avec un agent 
de blocage pour bloquertous les groupements 
de reference exposes, qui comprennent soit 
des groupements carboxyle, soit des groupe- 
ments amine primaires ; 

(ii) clivage des acides amines terminaux de re- 
ference de facon a exposer les parties termina- 
les de reference non bloquees ; et 

(iii) traitement des parties terminates de refe- 
rence non bloquees avec un agent demobili- 
sation pouvant dtre couple au support en phase 
solide ; 

dans lequel I'etape (b) comprend la liaison 
des parties terminates de reference traitees au sup- 
port en phase solide et ['elimination des fragments 
peptidiques non lies. 

5. Procede selon la revendication 1, comprenant en 
outre: 

(i) la preparation de I'echantillon de i'etape (a) 
par pretraitement des polypeptides avec un 
agent de blocage de facon a bloquer tous les 
groupements de reference exposes, qui com- 
prennent soit des groupements carboxyle, soit 
des groupements amine primaires, de telle sor- 
ts que le traitement subsequent de I'echantillon 
avec I'agent de clivage produise des fragments 
peptidiques portant des parties terminales de 
reference non bloquees ; 

(ii) le traitement des parties terminales de refe- 
rence non bloquees avec un agent ^immobili- 
sation pouvant §tre couple a un support en pha- 
se solide ; et 

(Iii) la liaison des fragments peptidiques conte- 
nant les parties terminales de/eference non 
bloquees au support en phase solide ; 

caracterise en ce que I'etape (b) comprend 
i'eiution des fragments peptidiques non lies. 

6. Procede selon la revendication 4 ou la revendica- 
tion 5, caracterise en ce que I'agent ^immobilisa- 
tion comprend un agent de biotinylation. 

7. Procede selon Tune quelconque des revendications 
4 a 6, caracterise en ce que le groupement de re- 
ference et un groupement carboxyle. 

8. Procede selon Tune quelconque des revendications 
precedentes, caracterise en ce que I'agent de cli- 
vage contient une peptidase. 

9. Procede selon Tune quelconque des revendications 
precedentes, caracterise en ce que I'echantillon 
de retape (a) contient une fraction subcellular. 
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1 0. Precede selon i'une quelconque des revend ications 
pr6c6dentes, caract6ris6 en ce qu'i! comprend en 
outre la preparation de P6chantillon de Tetape (a) 
par chromatographie en phase liquide. 

5 

1 1 . Precede selon I'une quelconque des revendications 
precedentes, caracterls6 en ce que la spectreme- 
trie de masse est precedee d'une etape de chroma- 
tographie en phase liquide sous haute pression 
pour la resolution des fragments peptidiques. 10 

1 2. Precede selon Tune quelconque des revendications 
1 a 1 1 , caracteris6 en ce que les fragments pepti- 
diques sont soumis a une chromatographie par 
echange d'ions avant Tetape (c). is 

1 3. Precede selon Tune quelconque des revendications 
precedentes, caracterls£ en ce que le nombre pre- 
determine de r6sidus d'acides amines est compris 
entre 3 et 30. 20 

14. Precede pour ('identification de polypeptides dans 
un echantillon a tester, comprenant la caracterisa- 
tion des polypeptides par un precede selon Tune 
quelconque des revendications precedentes, la 25 
comparaison des sequences de signature et des 
positions relatives du site de clivage ainsl obtenues 
avec les sequences de signature et la position re- 
lative du site de clivage de polypeptides de referen- 
ce afin d'identifier le polypeptide ou chaque poly- 30 
peptide dans rechantilion a tester. 

15. Precede pour la recherche d'un ou plusleurs poly- 
peptides specifiques dans un echantillon a tester, 
comprenant Pex6cution d'un precede selon Tune 35 
quelconque des revendications 1 a 14, caract6ris6 

en ce que I'agent de clivage et la position relative 
du site de clivage sont predetermines et la sequen- 
ce de signature est d6termin6e dans retape (c) par 
recherche d'une sequence pred6termin6e de r6si- 40 
dus d'acides amines a partir du site de clivage. 

16. Precede selon la revendication 15, caracterise en 
ce que le site de clivage et la sequence de signa- 
ture sont predetermines par la selection de sequen- 45 
ces correspondantes parmi un ou plusieurs poly- 
peptides cibles connus. 
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FIGURE 1 
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FIGURE 1 (continued) 
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Step 8 



Step 9 



FIGURE 1 (continued) 
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FIGURE 2 
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Step 6 



FIGURE 2 (continued) 
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FIGURE 2 (continued) 
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FIGURE 3 
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FIGURE 3 (continued) 
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FIGURE 4 
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FIGURE 4 {continued) 
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Step 10 
FIGURE 4 (continued) 
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